A Sequential Algorithm for Training

نویسنده

  • David D. Lewis
چکیده

The ability to cheaply train text classiiers is critical to their use in information retrieval, content analysis, natural language processing, and other tasks involving data which is partly or fully textual. An algorithm for sequential sampling during machine learning of statistical classiiers was developed and tested on a newswire text categorization task. This method, which we call uncertainty sampling, reduced by as much as 500-fold the amount of training data that would have to be manually classiied to achieve a given level of eeectiveness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SEISMIC DESIGN OPTIMIZATION OF STEEL STRUCTURES BY A SEQUENTIAL ECBO ALGORITHM

The objective of the present paper is to propose a sequential enhanced colliding bodies optimization (SECBO) algorithm for implementation of seismic optimization of steel braced frames in the framework of performance-based design (PBD). In order to achieve this purpose, the ECBO is sequentially employed in a multi-stage scheme where in each stage an initial population is generated based on the ...

متن کامل

Automatic measurement of instantaneous changes in the walls of carotid artery with sequential ultrasound images

Introduction: This study presents a computerized analyzing method for detection of instantaneous changes of far and near walls of the common carotid artery in sequential ultrasound images by applying the maximum gradient algorithm. Maximum gradient was modified and some characteristics were added from the dynamic programming algorithm for our applications. Methods: The algorithm was evaluat...

متن کامل

A new stochastic 3D seismic inversion using direct sequential simulation and co-simulation in a genetic algorithm framework

Stochastic seismic inversion is a family of inversion algorithms in which the inverse solution was carried out using geostatistical simulation. In this work, a new 3D stochastic seismic inversion was developed in the MATLAB programming software. The proposed inversion algorithm is an iterative procedure that uses the principle of cross-over genetic algorithms as the global optimization techniqu...

متن کامل

OPTIMAL DESIGN OF DOUBLE LAYER GRIDS CONSIDERING NONLINEAR BEHAVIOUR BY SEQUENTIAL GREY WOLF ALGORITHM

The present paper tackles the optimization problem of double layer grids considering nonlinear behaviour. In this paper, an efficient optimization algorithm is proposed to achieve the optimization task based on the newly developed grey wolf algorithm (GWA) termed as sequential GWA (SGWA). In the framework of SGWA, a sequence of optimization processe...

متن کامل

Sequential and Mixed Genetic Algorithm and Learning Automata (SGALA, MGALA) for Feature Selection in QSAR

Feature selection is of great importance in Quantitative Structure-Activity Relationship (QSAR) analysis. This problem has been solved using some meta-heuristic algorithms such as: GA, PSO, ACO, SA and so on. In this work two novel hybrid meta-heuristic algorithms i.e. Sequential GA and LA (SGALA) and Mixed GA and LA (MGALA), which are based on Genetic algorithm and learning automata for QSAR f...

متن کامل

A Random Forest Classifier based on Genetic Algorithm for Cardiovascular Diseases Diagnosis (RESEARCH NOTE)

Machine learning-based classification techniques provide support for the decision making process in the field of healthcare, especially in disease diagnosis, prognosis and screening. Healthcare datasets are voluminous in nature and their high dimensionality problem comprises in terms of slower learning rate and higher computational cost. Feature selection is expected to deal with the high dimen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994